Building a Rich Large-scale Lexical Base for Generation
نویسنده
چکیده
Most large lexical resources have been developed with language interpretation in mind and can not be used directly for generation. We present a rich large-scale lexical base for generation, constructed by merging various linguistic resources. Our approach meets the needs of language generation systems by providing the facilities for mapping from semantic concepts to verb/sense pairs, for identifying the valid subcategorization forms for a given verb sense, and for representing alternations for paraphrasing power. Information from diierent resources enriches and constrains each other, so the nal result is complete as well as accurate. We show by example how this lexical base can be intergrated into a generation package and how it simpliies development process while improving system performance.
منابع مشابه
Combining Multiple, Large-Scale Resources in a Reusable Lexicon for Natural Language Generation
A lexicon is an essential component in a generation system but few efforts have been made to build a rich, large-scale lexicon and make it reusable for different generation applications. In this paper, we describe our work to build such a lexicon by combining multiple, heterogeneous linguistic resources which have been developed for other purposes. Novel transformation and integration of resour...
متن کاملControlling The Application Of Lexical Rules
In this paper, we describe an item-familiarity account of the semi-productivity of morphological and lexical rules, and illustrate how it can be applied to practical issues which arise when building large scale lexical knowledge bases which utilize lexical rules. Our approach assumes that attested uses of derived words and senses are explicitly recorded, but that productive use of lexical rules...
متن کاملIntegrating a Large-Scale, Reusable Lexicon with a Natural Language Generator
This paper presents the integration of a largescale, reusable lexicon for generation with the FUF/SURGE unification-based syntactic realizer. The lexicon was combined from multiple existing resources in a semi-automatic process. The integration is a multi-step unification process. This integration allows the reuse of lexical, syntactic, and semantic knowledge encoded in the lexicon in the devel...
متن کاملCombining Dictionary-Based and Example-Based Methods for Natural Language Analysis
We propose combining dictionary-based and example-based natural language (NL) processing techniques in a framework that we believe will provide substantive enhancements to NL analysis systems. The centerpiece of this framework is a relatively large-scale lexical knowledge base that we have constructed automatically from an online version of Longman's Dictionary of Contemporary English (LDOCE), ...
متن کاملRobust Natural Language Generation from Large-Scale Knowledge Bases
We have begun to see the emergence of large-scale knowledge bases that house tens of thousands of facts encoded in expressive representational languages. The richness of these representations o er the promise of signi cantly improving the quality of natural language generation, but their representational complexity, scale, and task-independence pose great challenges to generators. We have desig...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997